Feature selection using self-information and entropy-based uncertainty measure for fuzzy neighborhood rough set
نویسندگان
چکیده
Abstract Feature selection based on the fuzzy neighborhood rough set model (FNRS) is highly popular in data mining. However, dependent function of FNRS only considers information present lower approximation decision while ignoring upper decision. This construction method may lead to loss some information. To solve this problem, paper proposes a joint entropy self-information measure (FNSIJE) and applies it feature selection. First, construct four uncertain measures variables, concept introduced into approximations from algebra view. The relationships between these their properties are discussed detail. It found that fourth measure, named tolerance self-information, has better classification performance. Second, an uncertainty been proposed Inspired by both views, FNSIJE proposed. Third, K–S test used delete features with weak distinguishing performance, which reduces dimensionality high-dimensional gene datasets, thereby reducing complexity then, forward algorithm provided. Experimental results show compared related methods, presented can select less important have higher accuracy.
منابع مشابه
Neighborhood rough set based heterogeneous feature subset selection
Feature subset selection is viewed as an important preprocessing step for pattern recognition, machine learning and data mining. Most of researches are focused on dealing with homogeneous feature selection, namely, numerical or categorical features. In this paper, we introduce a neighborhood rough set model to deal with the problem of heterogeneous feature subset selection. As the classical rou...
متن کاملFeature subset selection based on fuzzy neighborhood rough sets
Rough set theory has been extensively discussed in machine learning and pattern recognition. It provides us another important theoretical tool for feature selection. In this paper, we construct a novel rough set model for feature subset selection. First, we define the fuzzy decision of a sample by using the concept of fuzzy neighborhood. A parameterized fuzzy relation is introduced to character...
متن کاملA New Hybrid Framework for Filter based Feature Selection using Information Gain and Symmetric Uncertainty (TECHNICAL NOTE)
Feature selection is a pre-processing technique used for eliminating the irrelevant and redundant features which results in enhancing the performance of the classifiers. When a dataset contains more irrelevant and redundant features, it fails to increase the accuracy and also reduces the performance of the classifiers. To avoid them, this paper presents a new hybrid feature selection method usi...
متن کاملComparing Fuzzy-Rough and Fuzzy Entropy-assisted Fuzzy-Rough Feature Selection
Feature Selection (FS) methods based on fuzzy-rough set theory (FRFS) have employed the dependency function to guide the FS process with much success. More recently a method has been developed which uses fuzzy-entropy [9] to perform this task. Such use of fuzzy-entropy as an evaluation measure in fuzzy-rough feature selection can result in smaller subset sizes than those obtained through FRFS a...
متن کاملApplication of Fuzzy-rough Set Theory for Feature Subset Selection
Fuzzy Set Theory and Rough Set Theory are the most popular mathematical tools for dealing with uncertainties. During past decades, these set theories are being applied successfully in several areas for solving many complex tasks. This paper is concerned with the application of hybrid Fuzzy-Rough set based approach for feature subset selection. Keywords— Fuzzy set theory, Rough Set theory, Fuzzy...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Complex & Intelligent Systems
سال: 2021
ISSN: ['2198-6053', '2199-4536']
DOI: https://doi.org/10.1007/s40747-021-00356-3